Picture for Baolin Peng

Baolin Peng

EJ

OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents

Add code
Jun 01, 2026
Viaarxiv icon

Latent Recurrent Transformer: Architecture Exploration, Training Strategies, and Scaling Behavior

Add code
May 26, 2026
Viaarxiv icon

Orchard: An Open-Source Agentic Modeling Framework

Add code
May 14, 2026
Viaarxiv icon

WebXSkill: Skill Learning for Autonomous Web Agents

Add code
Apr 14, 2026
Viaarxiv icon

The Tool Illusion: Rethinking Tool Use in Web Agents

Add code
Apr 03, 2026
Viaarxiv icon

Reinforcement World Model Learning for LLM-based Agents

Add code
Feb 05, 2026
Viaarxiv icon

Adapting Web Agents with Synthetic Supervision

Add code
Nov 08, 2025
Viaarxiv icon

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents

Add code
Oct 10, 2025
Figure 1 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 2 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 3 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 4 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Figure 1 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 2 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 3 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 4 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Viaarxiv icon

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Add code
Apr 30, 2025
Figure 1 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 2 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 3 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 4 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Viaarxiv icon